Picture for Gongshen Liu

Gongshen Liu

HLL: Can Agents Cross Humanity's Last Line of Verification?

Add code
Jun 01, 2026
Viaarxiv icon

MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft

Add code
May 29, 2026
Viaarxiv icon

Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents

Add code
May 27, 2026
Viaarxiv icon

OS-SPEAR: A Toolkit for the Safety, Performance,Efficiency, and Robustness Analysis of OS Agents

Add code
Apr 27, 2026
Viaarxiv icon

SEARL: Joint Optimization of Policy and Tool Graph Memory for Self-Evolving Agents

Add code
Apr 09, 2026
Viaarxiv icon

Up to 36x Speedup: Mask-based Parallel Inference Paradigm for Key Information Extraction in MLLMs

Add code
Jan 27, 2026
Viaarxiv icon

Do Latent Tokens Think? A Causal and Adversarial Analysis of Chain-of-Continuous-Thought

Add code
Dec 25, 2025
Viaarxiv icon

Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents

Add code
Oct 02, 2025
Viaarxiv icon

Agent-ScanKit: Unraveling Memory and Reasoning of Multimodal Agents via Sensitivity Perturbations

Add code
Oct 02, 2025
Viaarxiv icon

On the Adaptive Psychological Persuasion of Large Language Models

Add code
Jun 07, 2025
Viaarxiv icon